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Abstract — Point-to-point  links  using  multiple  transmitters  and 
receivers  or  MIMO  (multiple  input,  multiple  output)  configura¬ 
tions  and  associated  signal  processing  at  the  receiver,  can  provide 
significant  improvements  in  both  data  rate  and  reliability  and 
is  a  promising  technology  for  enhancing  communications  in  the 
band-limited  and  highly-dynamic  underwater  acoustic  (UWA) 
channel.  Although  the  underwater  channel  impulse  response 
extent  can  span  tens  to  hundreds  of  symbols,  it  is  generally 
sparse  in  nature  and  well  suited  for  sparse  partial  response 
equalization  (sPRE).  This  equalization  scheme,  which  is  not 
restricted  to  MIMO  configurations,  does  not  attempt  to  suppress 
intersymbol  interference  completely,  rather  it  retains  residual 
ISI  in  a  controlled  manner.  This  is  accomplished  by  setting 
a  sparse  residual  impulse  response  target  generally  similar  in 
magnitude  and  time  to  the  dominant,  yet  also  sparse,  arrivals 
within  the  actual  channel  impulse  response.  The  resultant 
partial  response  equalizer  is  followed  by  a  complexity-reduction 
detection  scheme  known  as  “belief  propagation  (BP)”  which  is 
an  alternative  to  the  optimal  Viterbi  or  MAP  (maximum  a- 
posteriori  probability)  detector.  The  complexity  of  the  optimal 
schemes  grows  exponentially  with  the  total  number  of  taps, 
regardless  of  structure;  whereas  the  complexity  of  BP  grows 
exponentially  only  with  the  non-zero  taps.  Thus  the  entire 
receiver  structure,  sPRE  followed  by  BP,  is  suitable  for  the 
long,  sparse  channels  since  it  allows  more  efficient  exploitation 
of  the  channel  structure.  The  proposed  symbol  recovery  scheme 
was  applied  to  data  collected  during  a  comprehensive  multi¬ 
institution  MIMO  Experiment  conducted  within  the  Makai 
Experiment  in  2005  off  the  northwest  coast  of  Kauai,  Hawaii. 
We  have  demonstrated  a  reduction  in  error  rates  over  receiver 
algorithms  using  conventional  decision  feedback  equalization 
techniques  due  to  increased  multipath  diversity. 

I.  Introduction 

Iterative  cancellation  of  intersymbol  interference  (ISI),  also 
known  as  turbo  equalization  (TE)  for  coded  systems  is  pro¬ 
posed  in  [1],  In  this  scheme  a  soft  input,  soft  output  channel 
detector  (usually  the  maximum  a-posteriori  probability  (MAP) 
detector)  is  used  to  detect  the  channel  symbols  which  are  then 
decoded  using  another  MAP  decoder  for  the  error  correction 
code.  The  MAP  detector  and  the  MAP  decoder  then  exchange 
soft  (probabilistic)  information  iteratively  to  reduce  the  error 
rates  gradually  and  obtain  significantly  improved  performance 
over  traditional  non-iterative  algorithms.  However,  despite  its 
superior  performance,  the  use  of  TE  is  only  limited  to  ISI 
channels  with  short  memory  since  the  complexity  of  the  MAP 


detector  increases  exponentially  with  the  channel  memory, 
making  it  extremely  complex  (computationally)  for  longer  ISI 
channels. 

An  interesting  performance-complexity  tradeoff  for  chan¬ 
nels  with  long  memory  can  be  obtained  by  using  a  partial 
response  equalizer  (PRE)  wherein  instead  of  equalizing  the 
channel  completely  as  is  done  by  a  linear  equalizer  or  a 
decision  feedback  equalizer  (DFE),  it  is  equalized  to  a  target 
impulse  response  (TIR).  In  other  words  instead  of  removing 
the  ISI  completely,  the  channel  energy  is  concentrated  into  a 
smaller  number  of  taps  without  enhancing  the  noise  signifi¬ 
cantly.  The  combination  of  the  original  ISI  channel  and  the 
PRE  then  resembles  an  ISI  channel  with  a  shorter  memory. 
The  PRE  is  typically  followed  by  the  TE  scheme  which  can 
now  be  readily  implemented  due  to  the  smaller  memory  of 
the  “shortened”  channel. 

A  reduced  complexity  turbo  equalization  scheme  for  mul¬ 
tiple  input  multiple  output  (MIMO)  ISI  channels  has  been 
proposed  in  [2],  where  the  authors  have  designed  a  MIMO 
PRE  based  on  the  MMSE  criterion.  The  target  impulse 
response  is  also  optimized  by  maximizing  the  signal  to  noise 
ratio  (SNR)  at  the  PRE  output  under  certain  constraints 
on  the  target  tap  coefficients.  However,  there  are  two  key 
assumptions  in  the  work  of  [2]  and  they  are:  (1)  the  MIMO  ISI 
channel  is  quasi-static  fading,  that  is  the  channel  is  invariant 
during  the  transmission  of  a  code  word  (frame)  and  changes 
independently  from  one  frame  to  the  next,  and  (2)  the  channel 
is  known  perfectly  at  the  receiver.  There  is  also  an  implicit 
assumption  that  TIR  energy  is  always  concentrated  in  adjacent 
taps,  irrespective  of  the  original  channel  impulse  response 
structure. 

While  these  assumptions  are  reasonable  for  terrestrial  wire¬ 
less  environments,  they  may  not  apply  for  certain  situations, 
where  the  channel  is  rapidly  varying,  such  as  the  underwater 
acoustic  (UWA)  communication  channel.  For  such  channels 
accurate  instantaneous  channel  estimation  or  tracking  is  a 
computationally  intensive  and  difficult  task  and  adaptive 
equalization  solutions  are  often  preferred.  Moreover,  the  im¬ 
pulse  response  for  the  UWA  channel  is  often  sparse  (arrivals 
or  taps  with  significant  energies  are  well  separated  in  time), 
which  renders  a  dense  TIR  suboptimal. 
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Fig.  1.  Iterative  sPRE  +  BP  structure. 


In  this  paper  we  develop  a  sparse  partial  response  equalizer 
(sPRE)  using  the  RLS  algorithm.  Unlike  the  structure  of 
[2],  the  proposed  structure  is  training  based,  fully  adaptive, 
and  does  not  require  accurate  estimates  of  the  UWA  channel 
impulse  response.  This  adaptive  sPRE  also  adapts  the  TIR 
tap  magnitudes  using  the  maximum-SNR  criterion  [2],  Most 
importantly,  the  sPRE  is  designed  to  have  an  additional  degree 
of  freedom,  allowing  control  over  the  TIR  tap  placement, 
without  forcing  them  to  be  adjacent.  This  is  particularly 
useful  for  shallow  water  acoustic  channels  where  the  impulse 
response  is  long,  yet  sparse.  For  such  channels,  the  sPRE 
TIR  taps  can  be  chosen  to  be  sparse  and  made  to  coincide 
(in  position)  with  those  of  the  original  channel.  This  allows 
better  exploitation  of  the  multipath  diversity  inherent  in  ISI 
channels  by  retaining  energies  in  the  significant  channel  taps 
as  opposed  to  a  DFE  which  treats  the  secondary  arrivals  as 
interference  and  suppresses  them.  The  sPRE  block  is  followed 
by  a  low  complexity  channel  detection  scheme  known  a  belief 
propagation  (BP)  [3]  with  exponential  complexity  only  in  the 
number  of  non-zero  sPRE  TIR  taps.  For  coded  transmission, 
the  sPRE+BP  structure  is  followed  by  a  soft  input,  soft  output 
channel  decoder  which  exchanges  probabilistic  information 
with  the  sPRE+BP  block  (turbo  equalization)  and  achieves 
significant  improvements  in  error  rates.  Finally,  the  UWA 
channel  suffers  from  significant  doppler  shift  which,  if  not 
compensated  for,  causes  the  adaptive  equalizer  to  diverge 
[4],  Hence,  to  make  the  adaptive  sPRE  design  applicable  to 
real  UWA  channels,  a  phase  locked  loop  (PLL)  working  in 
conjunction  with  the  sPRE  has  been  developed.  The  joint 
sPRE-PLL  loop  adaptively  tracks  the  channel  phase  and 
updates  the  equalizer  coefficient  similar  to  the  DFE-PLL 
proposed  in  [4], 

The  rest  of  the  paper  is  organized  as  follows.  Section 
II  describes  the  transmission  scheme  and  the  discrete  time 
channel  model.  In  Section  III  the  sparse  PRE  algorithm  details 
along  with  those  of  the  the  embedded  PLL  are  derived. 
Section  IV  provides  numerical  results  obtained  by  monte- 
carlo  simulations  along  with  those  obtained  by  using  real  at- 
sea  data.  Finally  the  paper  concludes  in  Section  V. 

II.  Transmission  Scheme  and  Discrete  Time 
Channel  Model 

Consider  a  MIMO  system  with  tit  transmitters  and  hr 
receivers  operating  over  an  ISI  channel  with  D  +  1  symbol 


spaced  taps.  At  the  transmitter,  the  incoming  bit  stream  is 
spatially  multiplexed  to  form  tit  sub-streams.  Each  sub¬ 
stream  is  encoded  using  a  turbo  code  and  interleaved  using 
a  channel  interleaver.  Each  interleaved  sub-stream  is  then 
modulated,  pulse  shaped,  and  transmitted  simultaneously  over 
the  nr  transmitters.  The  discrete-time  baseband  equivalent  of 
the  signal  received  at  the  jth  receive  antenna,  after  low  pass 
filtering  and  sampling  can  be  expressed  by  a  tapped  delay 
line  model  as 


tit 

Vj(n)  =  £ 


k= 1 


'  D 

J2hkj(n)dk{n-l) 


u=o 


g tfkj  (n) 


Vj(n) 


(1) 


where  h^j  (n)  is  the  Ith  tap  of  the  D  +  l  tap  impulse  response 
vector  from  the  kth  transmitter  to  the  jth  receiver  at  time  n, 
and  dk(n)  is  the  symbol  transmitted  from  the  kth  transmit 
antenna  at  time  n.  The  term  (n)  captures  the  small  scale 
time  variations  of  the  channel  whereas  the  phase  rotation  due 
to  doppler  shift  is  treated  separately  using  dkj  [4], 


III.  Sparse  Iterative  PRE  with  Embedded  PLL 

This  section  develops  the  training  based  adaptive,  sPRE- 
PLL  using  the  RLS  algorithm.  As  explained  earlier,  the 
motivation  behind  using  a  sPRE  is  to  exploit  the  multipath 
diversity  inherent  in  ISI  channels  by  retaining  controlled 
residual  ISI  at  the  output  of  the  equalizer,  and  to  minimize 
the  TE  complexity  [1],  The  residual  ISI  structure  is  optimized, 
both  in  magnitude  and  tap  positions,  to  resemble  the  original 
channel  and  is  further  mitigated  by  using  a  near  optimal  (low 
complexity)  graph-based  detection  algorithm  known  as  belief 
propagation  (BP)  algorithm.  For  channel  coded  systems,  the 
sPRE  structure  is  designed  as  a  soft  input,  soft  output  module, 
allowing  us  to  perform  iterative  (turbo)  equalization  [1], 


A.  Design  of  sPRE  Filter  Coefficients 

Consider  the  discrete-time  system  model  of  equation  1 .  The 
received  signal  vector  at  each  time  is  formed  by  stacking  the 
received  samples  from  each  receive  antennas  as 

z(n)  =  [ui (n)  v2{n)  ...  vnR(n)]T  (2) 

and  the  transmitted  symbol  vector  at  time  n  is  denoted  by 

s(n)  =  [di(n)  d2(n)  ...  dnT(n)]r  ■  (3) 

The  design  goal  is  two  fold:  1)  shorten  the  original  dense 
channel  of  length  D+l  to  a  shorter  sparse  channel  of  memory 
Ds+ 1  <<  D+l  (partial  response  equalization);  and  2)  obtain 
estimates  of  the  channel  phase  dkj  from  the  kth  transmitter  to 
the  jth  receiver  and  compensate  the  channel  phase  explicitly 
(phase  locked  loop). 

The  adaptive  sPRE  consists  of  a  bank  of  n  rs  finite  impulse 
response  (FIR)  filters  as  shown  in  Figure  2.  The  function  of 
each  filter  is  to  collapse  the  original  channel  of  memory  D 
into  a  much  shorter  channel  of  memory  DS  +  1<<D  +  1. 
Moreover  in  the  sPRE  structure  we  have  the  flexibility  of 
choosing  the  number  of  equalizer  outputs  as  hrs  >  ut  [2], 
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For  our  application  we  will  typically  choose  hrs  =  Lnr, 
L  >  1.  Thus,  corresponding  to  each  transmitter  we  have  L 
sPRE  outputs. 

Each  FIR  filter  of  the  sPRE  consists  of  a  non-causal  part  of 
length  A?|  and  a  causal  part  of  length  N2.  Using  the  received 
signal  vector  z  (n)  in  equation  2  the  equalizer  input  vector 
y (n)  is  formed  as 

y(n)  =  [zT(n  + iVi)  ...  z T(n)  ...  zT[n-N1)]T  (4) 

where  N  =  Ni  +  _ZV2  +  1.  Suppose  Oij  is  the  phase  estimate 
of  the  link  from  the  ith  transmitter  to  the  jth  receiver  at  time 
n.  Using  this  information,  the  phase  compensator  matrix  for 
the  ith  transmit  antenna  is  formed  as 


—y{nT,  1) 
-y{nT,L) 


Fig.  2.  Adaptive  sPRE  structure. 


A  i{n)  =  diag 


o-jOi. 


, . .  e 


The  phase  compensated  signal  vector  for  the  ith 
is  then  given  by 


Vj(n)  =  Aj(?i)y(?r),  1  <i<nT- 


•(5) 


transmitter 


(6) 


In  order  to  apply  the  RLS  algorithm  to  compute  w „(*,  l)  and 
h n\i,l),  we  define  the  error  signal  at  the  ( i,l)th  output  of 
the  sPRE  as 


en(i,l)  =  h  (ns\i,l)x.iis)(n) -yn(i,l) 

=  h^(i,l)x\s\n) -wn(i,l)vi(n)  (10) 


As  explained  earlier,  each  transmit  antenna  has  L  FIR  filters. 
Denoting  the  Ith  filter  for  the  ith  transmit  antenna  as  w (i,  l), 
the  corresponding  sPRE  output  is  given  by 


and  form  the  error  vector  as  e„ 

[(^n  (1  ?  1)5**'  5  (1;  A))  *  *  *  (cn  (riT  1  1)  5  >  &n  (jlT  j  A))] 

The  cost  function  to  be  minimized  is  defined  as 


yn(i,  l )  =  w n(i,  l)vi(n)-  (7) 

The  aim  of  the  sPRE  design  can  now  be  more  formally  stated 
as 

•  adaptively  compute  w n(i,l)  such  that  the  equalizer  out¬ 
put  vector  yn(i,l)  is  equalized  to  a  TIR; 

•  optimally  design  the  TIR. 

The  shortened  TIR  vectors  are  denoted  as  l)  (row 

vector  of  length  Ds  +  1)  which  represents  the  residual  ISI 
at  the  Ith  output  of  the  ith  transmitter.  Moreover,  if  the  UWA 
channel  has  a  sparse  structure  (significant  energy  only  in  a 
few  well-separated  taps),  we  propose  to  design  the  TIR  to 
be  sparse  as  well,  with  the  non-zero  taps  made  to  coincide 
with  the  dominant  taps  of  the  original  channel.  Note  that  an 
accurate  estimate  of  the  original  channel  is  not  needed  for 
this  purpose.  Only  a  crude  measurement  of  the  power-delay 
profile,  obtained  by  channel  probing  suffices  to  choose  the 
delays  in  the  TIR.  Let  the  position  of  the  non-zeros  TIR  taps 
be  denoted  by  the  vector  [to  t±  ■  ■  ■  tds]  (he-,  the  ith  entry  of 
the  TIR  vector  corresponds  to  the  delay  r,).  Thus  the  output 
of  the  sPRE  corresponding  to  the  ith  transmitter  consist  of 
residual  ISI  from  the  symbols  d,(n  —  r0),  di(n  —  T\)  up  to 
di(n  -  td J. 

Let  us  denote  the  shortened  data  vector  for  transmitter  i  as 

xiS)(n)  =  [d»(ra-To),-"  ,  di(n  -  tDs)]t  ■  (8) 

Thus  the  sPRE  output  for  the  ith  transmitter  has  residual 
ISI  from  the  symbols  contained  in  x,-'s|  (n).  Using  the  above 
notations,  the  output  of  the  sPRE  can  be  written  as 


£(«)  =  5ZA"  kekek-  (11) 

k= 1 

The  optimum  solution  in  the  least  squares  sense  is  given  by 

l)  =  <f>r1(n)  9i(n)  l<i<riT,l<l<L  (12) 

where 


^i(n) 

n 

=  ]TA"-fev4(fc)vf(fc) 

(13) 

k= 1 

9i(n) 

=  ”-fev4(fc)x[s)H(fc) 

k= 1 

(14) 

are  the  (time  averaged)  output  correlation  matrix  and  the 
input-output  cross  correlation  matrix  respectively.  Denoting 
(n)  =  P;(?i)  for  simplicity,  we  have 

W =  P i(n)  9i(n)  h n)H(i,l) 

=  c i(n)  h£s^(«,  l )  1  <  i  <  nr,  1  <  l  <  L  (15) 

where  c,;(n)  =  P,  (n)  9i(n).  The  recursion  for  is  given 
by 

Ci{n)  =  c i(n  -  1)  +  Kt(n)af  (n),  (16) 


where  a,(n)  is  the  “a-priori”  error  vector  [5]  and  Kj(?i)  is 
the  RLS  gain  defined  as 


a.i(n)  =  x-s,(n)  -  cf  (?r  -  l)vj(n) 

K,(n)  =  p,(»-  1)vi(") 


A i  +  vf  (n)P i(n  -  l)vi(n) 


(17) 

(18) 


Using  equations  13,  14,  and  the  matrix  inversion  lemma  [5], 
the  inverse  correlation  matrix  P,  (n)  can  be  updated  as 


Vn(i,l)  =  hnS)(LOx!s)W  +rf{n)- 


(9)  Pi(n)  =  [I  -  K i(n)vf  (n)]  P4(n  -  l)/K-  (19) 
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The  shortened  data  vector  x^(n)  is  known  during  the 
training  mode  and  is  replaced  with 


where  h|,5* (i)  and  y n(i)  are  defined  in  equation  21.  The 
covariance  matrix  of  en(i)  is  given  by 


x -s) (n)  =  argmin  -  yn(i)||2  (20) 

X 

where 

H<s)(»)  =  [hW(i,l)T,...  ,hM(i,L)T]T, 

y  n(i)  =  [yn(i, !),■■■  ,lln(i,L)}T  (21) 


during  the  decision  directed  mode.  The  proposed  equalizer  is 
thus  versatile  since  it  can  always  be  trained  to  retain  ISI  of 
any  desired  structure  at  the  equalizer  output. 

1)  Soft  Input,  Soft  Output  sPRE  Module:  For  coded 
systems,  the  non-adaptive  partial  response  equalizer  proposed 
in  [2]  is  typically  followed  by  the  well  known  turbo  equal¬ 
ization  (TE)  scheme  [1],  The  PRE  itself  however  operates 
only  once  and  is  not  a  part  of  the  TE  process.  But  for  our 
application  where  the  partial  response  equalization  is  per¬ 
formed  adaptively,  the  convergence  of  the  iterative  algorithm 
can  be  significantly  improved  if  the  sPRE  structure  is  able  to 
utilize  soft  information.  With  this  motivation,  the  vector  sheer 
operation  of  equation  20  is  modified,  enabling  it  to  process 
additional  soft  information. 

At  every  time  instant,  the  vector  sheer  of  equation  20 
provides  an  estimate  of  the  shortened  data  vector  yif\n)  as 
defined  in  equation  8.  For  simplicity  let  us  assume  that  BPSK 
modulation  is  used.  Thus  the  entries  of  x,-^(n)  consists  of 
equiprobable  ±1  symbols.  Note  that  the  proposed  technique 
is  quite  general  and  can  be  easily  extended  to  any  M— ary 
modulation  scheme.  Also,  for  notational  convenience  let  us 
denote  the  kth  entry  of  x^(n)  as  xk-  In  the  presence  of  an 
outer  decoder,  the  a-priori  information  about  Xk  is  available 
as  La(xk)  =  log [p(xk  =  +l)/p{xk  =  -1)]. 

Using  this  additional  information  the  operation  of  the 
vector  sheer  (equation  20)  can  be  modified  as  follows.  The 
log  likelihood  ratio  (extrinsic  information)  of  xk  is  computed 
as 


Ee{Xk) 


i  P{y  nif)\xk  =  +1) 

gp(yn(*)|^  =  -D 


R  i  =  E 


en(i)e„(i)J 


-hW 


RW(0 


R  (s)  (s)  R.  (s)R'vjVjR'  (s) 


His)(i)"(23) 


where  the  ensemble  averages  R-x(»)x(,)>  RVxfs)(n)’  anc* 
RViVi  (n)  are  defined  as 


RviVi  (tr) 

R  (s)  (n) 

ViXj  ’  v  ' 


E 

E 

E 


=  Iz?„+i 


vi(n)vii/(?r)] 

Vi(n)x;(s)H(n) 


(24) 

(25) 

(26) 


Note  that,  while  Rv.v.  (?i)  and  Rv.x<»(n)>  in  general,  are 
different  from  the  time  averaged  quantities  4?,(n)  and  Gfn) 
(equations  13,  14),  R^^.(n)Rv  xw  (n)  can  be  well  approx¬ 
imated  by  (n)O-fn),  if  we  assume  that  the  transmitted 
data  vectors  evolve  as  an  ergodic  process.  Thus  we  write 


r>  — 1 


(27) 


Equation  23  can  then  be  written  as 


R  =  hW(») 


I  —  R 


H 


A*)' 


H 


The  recursion  for  R  (S)  can  be  shown  to  be 

ViX>  ' 

RViX(S)(n)  =  E  [v;(n)x(s)H(n)] 

=  r-^~X  RVixfs)  (n  -  !)  +  ^Vi(n)x(s)H(n)- 

is  then  chosen  by  maximizing  trace  [R^  1].  This 
optimization  can  be  achieved  by  choosing  the  rows  of  H„  ^  (i) 
to  be  the  eigenvectors  of  R,;  which  diagonalizes  R,  .  Thus 
we  compute 


h^M=U 


(28) 


=  log 


E  exP 

Xk=  + 1 

||y„(i)-H[1s)(i)x(s)(n)||2  |  ^  XjLa(  Xj) 

1"  E,  2 

iitk 

E  exp 

xk  =  -l 

||y„(i)-Hjf)(i)xP)(n)||2  (  ^  XjLa  (xj) 

2a2  l"  E,  2 

where  U/  is  the  eigenvector  corresponding  to  the  Ith  largest 
eigenvalue  of  R, .  Finally,  we  substitute  equations  28  and  16 
in  equation  15  to  obtain  the  optimum  sPRE  coefficient  vector 
w 


After  computing  the  extrinsic  information  for  \/xk, 
the  hard  symbol  decision  for  xk  is  made  as  xk  = 
sign  [Le(xk)  +  La(xk)\  to  form  the  shortened  data  vector 
x\s\n).  Note  that  without  the  a-priori  information  ( La(xk )  = 
0)  the  above  procedure  reduces  to  the  simple  vector  sheer  of 
equation  20. 

B.  Design  of  Optimal  Target  Impulse  Response 

We  now  compute  the  optimal  TIR  vectors  h[f  ')  (i.  1).  Using 
the  notation  of  equation  10,  the  error  vector  for  transmitter  i 
is  defined  as 

e„(i)  =  [en(i,  !),•••  ,en{i,L)]T  =  H^)(i)x,[s)(n)  -y  „(i)  C 


C.  Design  of  PLL 

Recall  from  equation  10  that  the  error  at  the  (■ i ,  l)th  sPRE 
output  is  given  by 

en(i,l)  =  -wn(i,l)[Ai(n)y(n)] 

where  A fn)  is  defined  in  equation  5.  The  channel  phase 
estimate  0y(n),  (1  <  i  <  nr,  1  <  j  <  tir)  is  obtained 
by  minimizing  the  cost  function 

TlT  L 

ee  =  E[eH (n)e(n)]  =  ^  ^2  E\en(i,  l)\2-  (29) 

1  2=1  1=1 
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Fig.  3.  Breakdown  of  each  PRE  block. 


The  gradient  of  eg  can  be  written  as 


deg 

dOy 


L 


1=1 


(30) 


where  =  J/(*>0  as  shown  in  Figure  3.  The 

phase  update  is  then  computed  as  (similar  to  [4]) 


6ij(n  +  1)  =  %(n)  +  Kfif3ij(n)  +  I<f2  ^  (31) 

m= 0 


where  /3ij(n)  is  the  instantaneous  gradient  estimate  given  by 


IV.  Results 

A.  Simulation  Results 

Numerical  results  based  on  monte-carlo  simulations  are 
provided  next.  We  compare  the  performance  of  the  iterative 
DFE  [6]  and  its  MIMO  version  [7],  [8]  with  the  proposed 
sPRE,  under  identical  conditions.  First  we  consider  the  single 
input,  single  output  (SISO)  case  where  at  the  transmitter, 
the  incoming  bit  stream  is  encoded  using  a  rate  1/2  turbo 
code  with  constituent  codes  having  generator  polynomials  as 
[1, 5/7]  octa|  -  The  resultant  codeword  is  then  interleaved  and 
BPSK  modulated.  We  consider  the  quasi-static  fading  channel 
case  (where  the  channel  assumes  a  particular  realization  for 
a  block  of  symbols  and  changes  independently  in  the  next 
block).  The  block  length  is  chosen  to  be  1000  symbols,  which 
is  approximately  equivalent  to  the  channel  coherence  time 
observed  using  real  data.  Moreover,  to  accurately  simulate  a 
shallow  water  acoustic  channel,  the  original  channel  impulse 
response  is  chosen  to  be  sparse  and  spans  25  symbols  as 
shown  in  the  sub-figure  of  Figure  4.  At  the  receiver  the  sPRE 
is  implemented  with  a  2  tap  TIR,  and  the  taps  are  positioned  at 
symbols  To  =  1  and  t\  =  21  to  approximately  coincide  with 
the  dominant  arrivals  of  the  original  channel.  The  simulated 
frame  error  rate  (FER)  is  shown  in  Figure  4.  Note  that  with 
optimally  chosen  tap  positions  the  sPRE  provides  significant 
performance  improvement  over  the  iterative  DFE.  This  is  due 
to  the  fact  that  the  controlled,  residual  ISI  present  at  the  sPRE 
output  allows  better  exploitation  of  the  multipath  diversity 


Fig.  4.  Simulated  frame  error  rate  comparison  for  1  transmit,  1  receive 
antenna. 


Fig.  5.  Simulated  frame  error  rate  comparison  for  2  transmit,  3  receive 
antenna.  Turbo  coded  LSTC  on  each  transmitter. 

inherent  in  ISI  channels.  The  DFE  on  the  other  hand  tries 
to  suppress  the  ISI  it  rather  than  exploit  it.  The  increased 
diversity  order  for  the  sPRE  can  thus  be  observed  by  noting 
the  rate  at  which  the  FER  falls  with  respect  to  the  SNR. 

Next  we  consider  the  MIMO  case  with  2  transmit  and  3 
receive  antennas.  At  the  transmitter  the  incoming  bit  stream 
is  multiplexed  on  to  the  two  transmitters  and  independently 
encoded  using  the  previously  described  turbo  code.  Figure 
5  shows  the  FER  performance  for  both  sPRE  and  DFE.  We 
observe  that  for  this  case,  in  the  high  SNR  regime,  there  is 
considerable  improvement  obtained  using  the  sPRE.  This  is 
because  for  a  2  by  3  system  there  is  large  diversity  order 
already  present  in  the  system.  The  sPRE  adds  to  that  by  ex¬ 
ploiting  an  additional  degree  of  freedom  (multipath  diversity) 
from  the  channel,  resulting  in  very  high  overall  diversity  order. 
This,  coupled  with  iterative  equalization  results  in  significant 
performance  improvement  over  DFE. 

B.  Experimental  Results  using  Real  Data 

In  this  sub-section,  results  are  presented  using  real  data 
collected  during  a  multi-institutional  experiment  conducted 
off  the  coast  of  Kauai,  Hawaii  in  September  2005.  For  this 
case  we  present  the  decoding  results  for  turbo  coded  SISO 
packets.  Figure  6  shows  the  convergence  behavior  of  the  DFE 
and  the  sPRE  in  the  high  frequency  regime  (near  50  kHz). 
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Fig.  6.  Turbo  equalizer  convergence  behavior  comparison  using  real  data 
in  the  low  SNR  regime  (  48  kHz  carrier)  for  different  training  lengths.  sPRE 
tap  position:  [1,6]. 

TABLE  I 

Decoding  result  using  real  data  for  turbo  coded  single 
TRANSMITTER  DATA  PACKET  TRANSMITTED  AT  40.5  KHZ  CARRIER. 
Multiple  numbers  in  a  column  indicates  multiple  turbo 
iterations. 


%  training 

DFE  #  bit  errors/4800 

sPRE  #  bit  errors/4800 

2  tap  TIR  placed  @[16] 

25 

219,  0 

0,  0 

20 

228,  0 

0,  0 

15 

691,  358,  0 

0,  0 

10 

510,  76,  0 

0,  0 

2 

failed,  50  %  errors 

868,  0 

1 

failed,  50  %  errors 

604,  0 

0.4 

failed,  50  %  errors 

774,  0 

The  operating  SNR  in  this  regime  is  significantly  low  due 
to  frequency  dependent  attenuation.  For  the  sPRE,  the  tap 
positions  are  determined  by  examining  the  channel  power- 
delay  profile  estimates  obtained  using  periodically  transmitted 
channel  probes.  The  performance  improvement  observed  by 
simulations  is  evident  with  real  data  also,  wherein  Figure  6 
shows  that  the  sPRE  converges  much  faster  (lower  decoding 
latency)  and  with  significantly  less  training  than  the  DFE 
(higher  overall  throughput). 

Next,  we  present  the  decoding  results  for  the  medium-to- 
low  SNR  regime  (near  40  kHz)  in  Table  I.  The  channel  profile 
for  this  case  exhibited  a  dominant  secondary  arrival  at  a  delay 
of  6  symbols  and  a  weaker  third  arrival  near  the  20th  symbol. 
The  sPRE  for  this  case  is  implemented  with  a  TIR  placed  at 
[1,6].  A  3-tap  TIR  placed  at  [1,6,  20]  was  also  tried  but  since 
it  did  not  provide  significant  gains  over  the  2-tap  TIR,  its 
results  are  omitted.  From  Table  I  we  observe  that  the  sPRE 
is  able  to  decode  successfully  with  considerably  less  training 
than  the  DFE.  Whereas  the  DFE  required  nearly  10%  training, 
the  sPRE  is  able  to  clear  the  packet  of  errors  even  at  0.4% 
training.  Moreover,  for  higher  training  lengths  (>  15%)  we 
observe  that  even  though  both  schemes  provide  zero  errors, 
the  sPRE  does  so  after  the  very  first  turbo  iteration,  while 
the  DFE  requires  to  iterate  two  or  three  times  before  it  can 
successfully  clear  all  the  errors. 


V.  Conclusions 

In  this  paper  we  have  proposed  an  adaptive  and  iterative 
MIMO  sparse  partial  response  equalization  (sPRE)  scheme 
for  shallow  water  UWA  channels,  along  with  a  jointly  op¬ 
timized  phase  tracking  loop.  The  proposed  scheme  exploits 
the  unique  long  and  sparse  structure  of  the  UWA  channel  and 
attempts  to  extract  multipath  diversity  by  retaining  controlled 
residual  ISI  at  the  output  of  the  equalizer.  The  residual  ISI 
structure  is  chosen  to  be  sparse,  to  approximately  coincide 
with  the  structure  of  the  original  channel.  The  proposed  struc¬ 
ture  is  also  designed  to  be  soft  input,  soft  output.  The  residual 
ISI  is  further  mitigated  by  using  a  low  complexity,  near 
optimal  detection  algorithm  known  as  the  “belief  propagation” 
(BP)  scheme.  The  sPRE+BP  scheme  is  followed  by  the  well 
known  turbo  equalization  block  in  order  to  reduce  the  error 
rates  gradually.  The  proposed  scheme  is  compared  with  the 
canonical  iterative  MIMO  DFE  via  simulations  and  also  using 
real  shallow  water  data.  It  is  found  that  for  channels  with 
long  and  sparse  structure,  the  sPRE  significantly  outperforms 
the  DFE  in  terms  of  error  rates.  Moreover,  it  is  shown  using 
real  data  that  the  sPRE  requires  much  fewer  training  symbols 
than  the  DFE  and  also  converges  much  faster  (fewer  turbo 
iterations)  compared  to  the  DFE,  making  it  an  attractive  low 
latency,  high  throughput  alternative  to  the  DFE  for  shallow 
water  acousticng  channels. 
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